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Abstract — This paper investigates the distributed power al- 
location problem for coordinated multipoint (CoMP) transmis- 
sions in distributed antenna systems (DAS). Traditional duality- 
based optimization techniques cannot be directly applied to 
this problem, because the non-strict concavity of the CoMP 
transmission's achievable rate with respect to the transmission 
power induces that the local power allocation subproblems have 
non-unique optimum solutions. We propose a distributed power 
allocation algorithm to resolve this non-strict concavity difficulty. 
This algorithm only requires local information exchange among 
neighboring base stations serving the same user, and is thus 
flexible with respect to network size and topology. The step- 
size parameters of this algorithm are determined by only local 
user access relationship (i.e., the number of users served by each 
antenna), but do not rely on channel coefficients. Therefore, the 
convergence speed of this algorithm is quite robust to channel 
fading. We rigorously prove that this algorithm converges to an 
optimum solution of the power allocation problem. Simulation 
results are presented to demonstrate the effectiveness of the 
proposed power allocation algorithm. 

Index Terms — Coordinated multipoint transmission, distribu- 
ted power allocation, distributed antenna system. 



I. Introduction 

The explosive growth of mobile access services has led 
to a huge demand for enhanced throughput and extended 
coverage in the next generation wireless networks. In recent 
years, distributed antenna system (DAS) has emerged as a 
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promising network architecture to achieve these goals [fl]-|[3]. 
In this architecture, each base station is equipped with some 
remote antennas which are distributed in the entire cell area, as 
shown in Fig. [TJ These distributed antennas are connected to 
the base station via wired backhaul network. By this, nearby 
distributed antennas are able to coordinate with each other 
and provide enhanced service experience to the mobile users. 
This technique is called the coordination multipoint (CoMP) 
transmission in the literature 0, H. 

One of the key techniques to realize high throughput in 
wireless networks is power allocation. Traditionally, power 
allocation of wireless networks is handled by centralized 
algorithms, e.g., JSJ — BSJ - These algorithms request multi-hop 
signaling mechanisms to gather the channel state informa- 
tion (CSI) of all the wireless links at a central processing 
unit in a short time period, and then distribute the obtained 
power allocation solution to the transmitters. Such mechanisms 
would generate enormous signaling overhead on the backhaul 
network, and is probably not scalable when the network size 
grows large. 

Recently, a great deal of research efforts have focused on 
distributed power allocation for various wireless networks. 
Game theory based power allocation techniques were pro- 
posed in j9l- lfT3l . which intend to compute Nash equilibrium 
power allocation solutions. However, these Nash equilibrium 
solutions might be far from optimality |[T2| . Duality -based 
distributed power allocation techniques were proposed in [fT4l - 
|[T6l , where the global power allocation problem is decom- 
posed into many local power allocation subproblems, each of 
which can be solved by utilizing only locally available network 
information. However, these techniques cannot be directly 
applied to CoMP transmissions in DAS — the local power 
allocation subproblem may have many optimum solutions, 
because the data rate of CoMP transmission is not strictly 
concave with respect to the power variables 11141 . ifTTI . Since 
no global network information is available when solving the 
local power allocation subproblems, it is quite difficult to 
find a global feasible solution among all the locally optimum 
solutions. 

One promising method to address this non-strict concavity 
problem is the proximal point method [18], which adds strictly 
concave terms to the objective function without affecting the 
optimum solution. However, typical proximal point algorithms 
require a two-layer nested iteration structure, where each 
outer-layer update can proceed only after the inner-layer itera- 
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Fig. 1. System model of coordination multipoint (CoMP) transmissions in 
a distributed antenna system (DAS). 

tions converge flT"8l . Such a structure is not suitable for on-line 
distributed implementation, because it is difficult to decide in 
a distributed manner when the inner-layer iterations can stop. 
In Ifl9ll , a single-layer proximal point algorithm was proposed 
for multi-path routing problems. However, the convergence 
analysis in |[T9l also cannot be directly utilized for the power 
allocation problem considered here, owing to the additional 
channel coefficients in our problem. It is difficult to answer 
whether the channel coefficients have significant impact on the 
convergence behavior of the algorithm mentioned above. 

This paper investigates the distributed power allocation 
problem for a downlink DAS with many antennas and many 
single antenna users. Each user is served by several nearby 
antennas via CoMP transmission techniques. Meanwhile, each 
antenna may serve several users over orthogonal channels. The 
main contributions of this paper are summarized as follows: 

1) A distributed power allocation algorithm is proposed to 
maximize the weighted sum rate of the downlink DAS, 
subject to per-antenna power consumption constraints. 
This power allocation algorithm is implemented distribut- 
edly among the base stations instead of being executed 
in a centralized fashion. The algorithm possesses a nice 
single-layer iteration structure, which is desirable for on- 
line implementations. In each iteration, the algorithm only 
requires local information exchange among neighboring 
base stations serving the same user, which is flexible with 
respect to network size and topology. 

2) A novel procedure is proposed to compute the primal op- 
timum solution of the local power allocation subproblem, 
which is simpler than that proposed in |[T9l . 

3) We rigorously analyze the convergence and optimality of 
the proposed distributed algorithm for the power alloca- 
tion problem. The bounds of the step-size parameters to 
ensure convergence are derived. 

4) We show that the step-size parameters of this algorithm 
are determined by only local user access relationship (i.e., 
the number of users served by each antenna), but do not 
rely on channel coefficients^- Therefore, the convergence 

'The channel coefficients are only utilized locally to solve the local power 
allocation subproblem. 



speed of this algorithm is quite robust to channel fading. 

Our proposed power allocation algorithm is motivated by 
the work of |[T9l . However, our work differs from it in several 
respects. First, our analysis indicates that a larger step-size 
can be used for the algorithm in |[T9l , which can achieve 
a faster convergence speed. Second, while our problem has 
additional channel fading coefficients, we show that the step- 
size parameters and the convergence speed of our algorithm 
are robust to different channel fading coefficients. Finally, our 
procedure for solving the local power allocation problem is 
simpler than that proposed in |fl9l . 

For ease of later use, we define the following notations: Let 
IS* | denote the number of elements in set S, and let S/T denote 
the set S/T = {x\x 6 S,x ^ T}. The projection of a real 
number x on the set [0, oo) is defined as [x] + = max{x, 0}. 

The remaining parts of this paper are organized as follows: 
In Section [ill the system model and problem formulation are 
presented. Section [III] presents the proposed power allocation 
algorithm and its distributed implementation. Simulation re- 
sults of the proposed power allocation strategy are presented 
in Section IV. Conclusions are drawn in Section V. 

II. System Model and Problem Formulation 

A. System Model 

We consider a downlink DAS with K distributed antennas 
and N single antenna mobile users, which are denoted by K, = 
{1,2,- •• ,K} and Af = {1,2,- •• ,N}, respectively. Each 
base station is equipped with several distributed antennas, as 
illustrated in Fig. [T] These distributed antennas are connected 
to the base station via wired backhaul network. The total 
throughput of this network is limited by the strong co-channel 
interference. By allowing several nearby antennas to transmit 
to one user in a coordinated fashion, the CoMP transmission 
techniques, such as space-time block coding or maximum 
ratio transmission |3|. convert the strong interferences into 
useful signals and thereby significantly boost the total network 
throughput. The set of antennas serving the ?ith user is denoted 
by B,(n) C /C, and the set of users communicating with the kth 
antenna is expressed as U(k) C J\f. In practice, the number of 
serving antennas for each user, i.e., is usually small, 

due to the limitation of implementation complexity for CoMP 
transmissions. 

When the density of the distributed antennas is high, CoMP 
transmissions can not mitigate all the strong interferences, 
which results in some strong residual interferences. In lEOl . 
it was shown that orthogonal transmission is Pareto optimal 
for strong interference Gaussian channels. Therefore, the users 
with strong mutual interference should be scheduled to com- 
municate over orthogonal channels via frequency (or time) 
division multiple access, while geographically separated users 
with weak mutual interference are allowed to share the same 
channel resource. This scheduling task belongs to the type 
of timetabling problem, which is a classic problem in the 
computer science literature with many practical algorithms 

ed, ma. 

After selecting proper antennas for CoMP transmission and 
scheduling the users, there are only weak interferences in the 



ACCEPTED BY IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, FEB. 2013 



3 



network. We consider a slow fading wireless environment. Let 
hkn be the complex coefficient of the wireless channel from 
the fcth antenna to the nth user and pk n be the transmission 
power of the fcth antenna for serving the nth user. The data 
rate of CoMP transmission to the nth user is given by 



DAS: 



N 



C„ = log 2 1 



0-2 + Z)(k,m)eJ(n) l^ fc "l P kr > 



, (1) 



where I(n) is the set of source antenna and serving user 
pair which may interfere the nth user, or more specifically, 
(k, m) £ I(n) represents that the fc-th source antenna serving 
the ?n-th user through the same serving channel of the nth 
user. 

There are two difficulties for utilizing the data rate function 
C„ to formulate the power allocation problem: First, it leads 
to a non-convex optimization problem that is NP-hard ||231 . 
for which one may not be able to find a solution that is both 
fast and global optimal even by centralized optimization. The 
design of a distributed optimization algorithm will be even 
more difficult, if not impossible. In order to reduce the solution 
complexity, we need to find an approximate rate function of 
C„ that is convex. Second, it can be quite difficult to attain 
the exact expression of the rate function C„. In practice, the 
number of interfering antennas is usually much larger than 
the number of source antennas. Although the receiver can 
get an accurate estimation of the channel gain \hkn\ 2 for 
each source antenna k 6 R(n), it may be too demanding 
to estimate the channel gain from the enormous interfering 
antennas, especially when the powers of the interference 
signals are weak. On the other hand, estimating the noise- 
plus-interference power a 2 = a 2 + Yl(k,m)el(.n) \hkn\ 2 Pkm 
is obviously much easier. For these reasons, we consider to 
utilize an upper bound of the noise-plus-interference power 
ofj, which is denoted by cr^ peak , to derive an approximate 
rate function. Let 7fc„ = \hkn\ l°n peak denote the normalized 
channel gain from the fcth antenna to the nth user. Then, we 
derive a conservative rate function 



Pkn 
S.t. 



max ^ W n l0g 2 1 + ^ Pknlkn \ 

n=l \ keR(n) 

Pkn<Pk, k = l,2,...,K, 

neU(k) 

p kn >0, k = l,2,...,K, neU(k), 

where w n > is the weight of the nth user's data rate and 
Pk is the maximal allowable transmission power of the fcth 
antenna. 

This problem is a convex optimization problem, which can 
be solved by standard centralized convex optimization algo- 
rithms such as the interior point method 11241 . However, these 
centralized algorithms are hard to be fulfilled in large-scale 
DAS, due to the heavy signaling overhead over the backhaul 
network. In contrast, duality-based optimization techniques 
lfT4l - lfT6l cannot be directly applied to this problem, either, be- 
cause they require the objective function to be strictly concave. 
However, the objective function in (fj) is not strictly concave 
with respect to the transmission power variables, since it is 
constant when the value of YlkERM Pknlkn is fixed. If the 
duality-based optimization techniques lfT4l - lfT6l are utilized, 
the decomposed local power allocation subproblem may have 
many locally optimum solutions at some special dual points. 
It is quite difficult to recover a global feasible solution among 
all the locally optimum solutions. When the dual variables are 
updated around these dual points, the primal power allocation 
variables keep oscillating and hardly converge (see lTl9l for 
more details). 

III. Distributed Power Allocation Algorithm 

In this section, we propose a power allocation algorithm to 
solve the problem (f3]l, which is distributed among the base sta- 
tions instead of being centralized over the entire network. The 
key feature of this algorithm is that its step-size parameters 
and convergence speed are robust to different channel fading 
coefficients, which makes our algorithm quite convenient for 
practical implementations. The details are provided in the 
following subsections. 



C„ = log 2 1 



^ Pknlkn 
keR(n) 



<C„. 



(2) 



The key benefit of the conservative rate function C„ is that 
it is convex and is computable without accurate knowledge of 
the channel gain \hk n \ 2 for the enormous interfering antennas, 
which resolves the two difficulties mentioned above. We will 
illustrate the rate loss for using this conservative rate function 
to formulate the power allocation problem in Section HVl 



B. Problem Formulation 

The rest of this paper focuses on the following power 
allocation problem to maximize the weighted sum rate of the 



A. Single-layer Distributed Power Allocation Algorithm 

To circumvent the aforementioned oscillation problem, we 
make use of the idea in the proximal point method lfT8l . 
which is to add some quadratic terms to the objective function 
and make it strictly concave in the primal variables. We 
reformulate the original power allocation problem (fj} as 



max 



N 

E 

n = l 



w n log 2 (l + 



k£R(n) 



Pknlkn) 



N 



E E -^(Pkn-Vkn) 2 
n=l k£R(n) 



(4) 



s.t. Yl Pkn<Pk, k = 1,2,..., K, (5) 

n£U(k) 

Pkn >0, k = l,2,...,K, n€U(k), 
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where we have introduced some quadratic auxiliary terms to 
make the objective function strictly concave with respect to 
the transmission power variables. Here, y kn is the auxiliary 
variable corresponding to p kn , c n > is the parameter of 
the quadratic terms. For notational convenience, let us use 
the |-R(n)| dimensional vector p n to denote the transmission 
power variables of the antennas serving the nth users, and 
the Yln=i \R( n )\ dimensional vector p = [p[ ,p% , • • • ,Pn] T 
to denote all the transmission power variables. Similarly, we 
define the \R{n) \ dimensional vector y n and the 
dimensional vector y = [iff \y% ', • • • ,y^] T as the auxiliary 
variable vectors corresponding to p n and p. It is known that 
the optimum value of the objective function in coincides 
with that in (0 1181 . In particular, if p* is the optimum solution 
to ©, then p = p*,y = p* solves ©. 

The standard proximal point method in general has a two- 
layer nested optimization structure: the inner layer iterations 
optimizing p for fixed y by a Lagrangian dual optimization 
method, and the outer layer iterations optimizing the auxiliary 
variable y. Such a layered structure is not suitable for on-line 
distributed implementations, because it is difficult to decide 
in a distributed manner when the inner-layer iterations have 
converged. In the following, we present a modified proximal 
point method with a single-layer optimization structure, where 
the outer-layer update of y does not request that the inner-layer 
dual updates have converged. 

The Lagrangian of the problem dU can be written as: 



L(p,\,y) 



N 

E 

n=l 



log 2 (l 



/ J Pknlkn) 
keR(n) 



K 



E Afc ( E Pkn ~ Pfc ) 
k=l n£U(k) 



N 



E E -J^Pkn - Vknf , 



(6) 



n=\ keR(n) 



where A = [A 1; A 2 , • ■ ■ , Xk] T is the vector of dual variables 
corresponding to the constraints in (|5}- Now we are able 
to present our distributed power allocation algorithm as the 
following: 

Algorithm A: Single-layer Distributed Power Allocation 
Algorithm 

At the ith iteration, 

Step 1: Dual variable update: 

Let y = y(t) and A = X(t), maximize L(p,X,y) with 

respect to p: 

p(t) = aigmax^ L(p, X(t),y(t)). (7) 
Update the dual variables by 

X k (t+l) = [X k (t)+a k ( ]T Pkn-Pk)} + , (8) 

n£U(k) 

where a k is the step-size of the dual update. 
Step 2: Auxiliary variable update: 

Let y = y(t) and A = X(t + 1), maximize L(p, A, y) with 
respect to p: 

z(t) = arg maxp> L(p, X(t + 1), y(t)). (9) 



Update the auxiliary variables by 

Vkn(t + 1) = Vkn{t) + P(z k n{t) ~ Vkn{t)), (10) 

where < f3 < 1 is the step-size for auxiliary variable 
update. 

The value of f3 can be chosen arbitrarily in (0,1]. The 
choices of a k to ensure convergence of Algorithm A will be 
discussed in Section [TH-CI We note that while the convergence 
analysis in Ifl9l apply for the degenerated case of ^j kn = 1, 
it is difficult to answer if practical channel coefficients ^ kn 
would have significant impact on the convergence behavior of 
the algorithm. One major contribution of this paper is to show 
that the step-size parameters <x k to ensure convergence are 
irrelevant of ^ kn (see Section IIII-Cb . Since the convergence 
speed of iterative optimization algorithms is mainly affected 
by the step-size, the convergence speed of our algorithm is 
quite robust to different values of j kn . We will also show 
that step-sizes larger than those of |fl9l can be utilized in our 
algorithm to achieve a faster convergence speed. 



B. Distributed Implementation of Algorithm A 

We proceed to explain how to implement Algorithm A in 
a distributed fashion. The Lagrangian maximization problems 
(0 and (O can be decomposed into many independent local 
power allocation subproblems for each user. Specifically, the 
terms of the Lagrangian © can be reassembled as 



Lip, X,y)=^2 



w, 



l0g 2 (l + ^ Pknlkn) ~ 2J ^kPkn 
keR(n) fcei?.(n) 

K 

^A fc P fe . (11) 



Vkn) 



k€R(n) 



k=l 



Therefore, the Lagrangian maximization problems (|7) and © 
can be rewritten as 



N K 

maxL(p,X,y n ) = V" max B n (p n , A, y n ) + } X k P k , (12) 

^r>n — ' r> >o * — * 
n=l y - fc=l 



p>0 

where 



B n {p n , X,iJ n ) =W„l0g 2 (l + 2_j Pknlkn)- ^ X k p kn 

keR(n) keR(n) 

~ E -^(.Pkn-Vkn) 2 - (13) 

Therefore, problems ([7) and (|9} can be decomposed into a 
series of local power allocation subproblems. 

In practice, the resource allocation of a user is carried out at 
a nearby base station. However, the antennas serving this user 
may belong to several base stations, as illustrated in Fig. Q] 
Therefore, neighboring base stations need to exchange infor- 
mation during the iterations of Algorithm A. The distributed 
implementation procedure of Algorithm A is described as 
follows: 

At the tth iteration of Algorithm A, the base station 
assigned to the nth user first utilizes the channel quality 
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information {lkn}keR(n) to solve a subproblem of (0, given 
by 

{Pkn(t)}keR(n) = Bxgmaxp n > Q B n (jp n , X(t),y n (t)), (14) 

and forwards the power allocation solutions {pkn(~t)} keRM 
to nearby base stations controlling the antennas k G R(n). 
Then, the base station controlling the fcth antenna utilizes the 
power allocation solutions {pkn(t)} n eu(k) to update the dual 
variable Afc(i + 1) according to dS), and sends Xk(t + 1) to the 
base station assigned to the nth user. Next, the base station 
assigned to the nth user solves a subproblem of ©, i.e., 

{zkn(t)}k£R( n ) = argmaXj ?n > .B„(p n , X(t + 1), y„(t)), 

(15) 

and utilizes the resultant solution {zk n {t)}keR(n) to update the 
auxiliary variables yk n (t + 1) according to ([Tol l. Therefore, 
Algorithm A can be implemented in a totally distributed 
fashion, and it only requires local exchange of the power 
allocation solution Pkn(t) and the dual variable Xk(t + 1) 
among neighboring base stations in each iteration. In addition, 
when the channel power gain -fkn changes, each user sends 
the updated channel power gain to its assigned base station. 

1 ) Solution to Local Power Allocation Subproblem (I141) : 
The optimum solution to ( fT~4-b satisfies the following Karush- 
Kuhn-Tucker (KKT) conditions (21: 

OB 

n w nlkn a 

dp kn In 2(1 + J2keR(n) Pknlkn) 

= 0, if p kn > 0; 
<0, ifp fc „ = 0, 



CnijPkn ~ ykn) 



Define 



fi(n) = {k G R(n)\p kn > 0} 



V k G R(n). 
(16) 

(17) 



as the set of antennas serving the nth user with positive power. 
Hence, pk n = for all k G R(n)/Cl(n). If O(n) is known, 
the KKT conditions in (TBI) indicate 

Wnlkn , / n n 

i on j ^ r - Afc - C n (p k n ~ ykn) = 0, 

ln2(l+^ feen(n) Pknlkn) 

V k e O(n). (18) 

By conducting a weighted summation of the equations in dT8l ), 
we obtain an equation of Efc G n(n) Pknlkn, i-e., 

Ea.-gJ2(^ 



T / IknXk ~ C„ IknPkn 



AGfi(n) 
fc£0(ra 



Aef2(n) 
7fen2/fcn = 0. (19) 



Let us define s„ = Efc £ n(n) Pknlkn, then dT9j can be 
reformulated as 



CnSn (Cn f^n)^n ~t~ 7ri 



0. 



(20) 



where 7 „ = E fee n(n) W »7L/ ln2 < M« = Efeenf„) W** - 
c n ykn)- The root s„ of the quadratic equation (|20b is given by 

s n = 5— [~(c„ + Mn) + V (c„ + ^„) 2 - 4c„(^„ - t~)]. 

(21) 



Substituting d2TT > into d!8t . we obtain the optimum solution to 
the subproblem ( fT4l as 



PA- 




A* 



if k G fl(n) 



ln2(l+s„) 

if fc G i?(n)/fi(n). 

(22) 

2) A Novel Procedure to Derive f2(n): Until now, the left 
task is to determine f2(n) in the optimum solution to (fT~4T >. Let 
us consider the unconstrained problem corresponding to ( [Pfl i. 
i.e., 

p n ,o = argmax5 n (p„, A, y„). (23) 



Our research indicates that if pkn.o in the solution to 
satisfies pkn.o < 0, then the solution to (fT~4T > must satisfy pfe„ = 
(i.e., k 51(n)). This statement is expressed in the following 
lemma: 

Lemma 1 Suppose that g(x) is a differentiable concave 
function on x G [0, oo), and B{p) is defined as B(p) = 

9(J2kPklk) - J2k( a kPk + b kPk + Cfc) With Cl k > 0. // 

{p* k } = argmaxp.> i?(p) and {p k ,o} = argmax^B(p7, then 
p* k = /or any k satisfying pk t a < 0. 

Proof: See Appendix A. ■ 
With Lemma Q] we are able to compute the optimum choice 
of fi(n). The detailed procedure is given as follows: 
P-l. Initialization: Set Q(n) = R(n). 

P-2. Compute s„ and pk n according to d2"T1 i and (l22l . respec- 
tively. 

P-3. If pfe„ > for all fc G Jl(n), output J7(n) and exit; 
otherwise, set Cl/ n ) = {k\pk n > 0, fc G i?(n)} and return 
to P-2. 

Remark 1: Lemma Q] allows us to rule out all the elements 
k with pkn < from D,(n) in one iteration. Therefore, the 
proposed procedure can converge much faster than the method 
proposed in |fl9l , l25l . which is to eliminate only one element 
k with the smallest negative p kn in each iteration. Our method 
significantly reduces the number of iterations to compute S7(n) 
and does not require sorting procedure of {p kn }- 

C. Convergence Analysis 

In this subsection, we obtain the bounds on the step-sizes 
at to ensure convergence. First, we need some notations 
and definitions to simplify the expressions of our theoretical 
analysis. Let us consider the function 

= f E^=l W n l0g 2 (l + J2keR(n) Pknlkn), if Pkn > 0, 



f(P) 

-oo, otherwise, 

(24) 

which has incorporated the power constraint p > in the 
definition. The analysis of this subsection applies to any 
objective function f(p) in the form of }~2 n /n(EfePfe«7fen) 
with /„ {■) being a concave function. With d24T >. the Lagrangian 
© can be rewritten as 

L(p,X 7 y) = f(p)-p Jr E T X-^(p-ij) T V(p-y)+X T R, (25) 

where E is a K x Ej=i l-^(j) I dimensional matrix with binary 
elements representing the relationship between the antennas 



ACCEPTED BY IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, FEB. 2013 



6 



and their transmit power variables, i.e., if the fcth antenna is 
selected to serve the nth user, one of the |i?(n)| elements on 
the fcth row and the (E"=i |-R(j)| + l)thto the (E" =1 \R(j)\)th 
columns is 1; otherwise, all of these |i?(n)| elements are 0. 
Moreover, it satisfies 

££al%)l K 

E ki = \U(k)\, Vi, J2 E ^ = 1 ' Vfc ' (26) 

i=l k=l 

because the fcth antenna serves \U(k)\ users and each trans- 
mit power variable belongs to only one antenna. V is a 
Sj=il-^(j')l x 2~2f=i\R(j)\ diagonal matrix with diagonal 
elements c n representing the parameters of the quadratic terms. 
R = [Pi, P2, Pk] T represents the vector of maximal 
transmission power of the antennas. Therefore, the Lagrangian 
maximization problems © and (0 can be expressed as 



and 



z(t) = arg max-L(p, X(t + 1), y(t)), 



(27) 



(28) 



respectively. Let A be a K x K diagonal matrix with diagonal 
elements ct k representing the step size for dual update. Let B 
be a Y^,jLi I x SjLi I diagonal matrix with diagonal 
elements /3 representing the step size for auxiliary update. 
Then, the dual update (0 and auxiliary update (0 can be 
rephrased as 



X(t + l) = [X(t) + A(Ep(t)-R)} + , 



(29) 



and 



y(t + 1) = y(t) + B(z(t) - y(t))- (30) 
We also need to define the stationary point of Algorithm 



A. 



Definition 1 A point (y*,X*) is a stationary point of Algo- 
rithm A, ;/ 



V 



arg max L(p, X* , y* 



Ey* - R < 0, A* > 0, 
A* <g> (Ey* — R) = 0, 



(31) 

(32) 
(33) 



where x ® y represents the Hadamard (elementwise) product 
of two vectors x and y with the same dimension. 

Let us further consider a Lagrangian maximization problem 
p = argmaxp'L(p, X,y). The KKT conditions suggest that 
there must exist a subgradient V/(-) of /(•) satisfying 



Vf(p) - E T X - V(p - y) = 0. 



(34) 



Similarly, let (y*, A*) denote a stationary point of Algorithm 
A, then we can get from OTb that 



V/(r ) - E T X* = 0. 



(35) 



Now we are ready to introduce the main result of this paper 
in the following theorem, i.e., the sufficient condition for the 
convergence of Algorithm A. 



Theorem 1 If the objective function f(p) is in the form of 
J2n fn(J2kPknlkn) with /„(■) being a concave function, and 
the step-size satisfies 

2min{„ e ,7( fe )} c„ 



0!fc < 



(36) 



3|C/(fc)| 

where \U(k) \ is the number of users served by the kth antenna, 
the proposed distributed power allocation Algorithm A will 
converge to a stationary point (y*,X*) of the algorithm, and 
p* = y* is an optimum solution. 

The proof of Theorem Q] relies on the following key result: 



Lemma 2 Let (pi, Ai) and (p2, A2) be two maximizers of the 
Lagrangian ( |25t for fixed auxiliary variable y, i.e., p\ = 
arg max L [p, y, Xi ^ and P2 — arg max Z (^p, y, A2^, and 

(y* , A*) is a stationary point of Algorithm A, then 



< 7 fA 2 - Xi) EV~ 1 E T (X 2 - X 



(37) 



where Vf(pi) and V/ (y*) are defined in i34i and (I35l >. 

Proof: See Appendix B. ■ 
With Lemma [2] we are able to prove Theorem 1 . The details 
are relegated to Appendix C. Some remarks about Theorem 1 
are provided as follows: 

Remark 2: If we choose = """Ji^/^)^ C " ■> then the 
step-size parameters a k do not rely on the channel coeffi- 
cients 7fc n . On the contrary, they are only determined by the 
number of users served by the fcth antenna, i.e., |f/(fc)|. Since 
the convergence speed of iterative optimization algorithms is 
mainly affected by the step-size, the convergence speed of our 
algorithm is quite robust to different values of jkn- 

Remark 3: It is worthwhile to note that the channel fading 
coefficients 7fc„ is involved in f(p) on the left hand side of 
(|37| |. by not in the right hand side of (l37T i. This is the key 
reason that the bound on the step-size a k in Theorem Q] is 
irrelevant to j kn . 

Remark 4: In |fl9l Lemma 3], the authors proved that 

[V/(pi)-V/(r)] T (P2-r) < \(X 1 -X 2 ) T EV~ 1 E T (Xi-X 2 ), 

(38) 



for the degenerated case of j k n = 1- One can see that 
is looser than the inequality (|37| | in Lemma [2] Moreover, in 
|fl9l Proposition 4], the authors only proved the convergence 
of their algorithm for the step-sizes a k 



2max {fce/C } \U(k)\ ' 

which are smaller than the step-sizes of our algorithm, i.e., 
a k = min ^u(k}\ )} C " • Therefore, our algorithm can achieve a 
faster speed of convergence than that of lfl9l. Some numerical 
results will be provided in the next section to illustrate this. 

Remark 5: Theorem[T]provides a sufficient condition for the 
convergence of Algorithm A, for all the system circumstances. 
According to our simulation experiences, there exist some 
circumstances that larger step-sizes than those of (l36l can 
also obtain an optimum solution to problem (0. However, it 
is difficult to prove that such weaker conditions ensure the 
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convergence of Algorithm A uniformly for all the system 
circumstances. 

Remark 6: If the channel gains change before convergence 
as in the slow fading environment, the resultant power allo- 
cation solution may not be optimum. However, Algorithm A 
is able to track the changes of the slow fading environment 
to some extent. For example, suppose that the channel gains 
change after the algorithm has reached a near optimum solu- 
tion. We can still use the dual variable and auxiliary variable of 
the last iteration as the initial state of the subsequent iterations. 
As long as the changes of channel gains are small, the dual 
variable and auxiliary variable of the last iteration is not far 
from the optimum solution, and the number of iterations for 
convergence is much smaller than using a random initial state. 

IV. Numerical Simulations 

In this section, we present some simulation results to 
demonstrate the efficiency of the proposed power allocation 
algorithm. We consider a downlink DAS with 7 cells. Each cell 
is equipped with 7 distributed antennas, including 1 antenna 
locating at the center of the cell and 6 remote antennas 
distributed near the boundary of the cell. Similar with Fig. [TJ 
the locations of these 49 antennas form a hexagonal lattice. 
The minimal distance between two neighboring antennas is 
D = 1000 meters. The users are distributed uniformly in 
the entire network area, with the extra constraint that the 
distance from a user to a nearest antenna is no smaller than 
10 meters. The wireless channel coefficients are composed by 
three components: large-scale path loss, shadowing, and small- 
scale Rayleigh fading. The path loss and shadowing are deter- 
mined by the SCM model for Urban Macro environments fl26) . 
Specifically, the path loss is given by PL = 34.5+35 \og 10 (d), 
where d is the distance in meters between the user and the 
antenna. The shadowing component satisfies a log-normal 
distribution with zero mean and a standard deviation of 8 
dB. For downlink CoMP transmissions, each user is served by 
\R(n) \ = 3 antennas, which are selected based on large-scale 
channel path loss. The maximal transmission power of each 
antenna is assumed to be the same, i.e., = P- The data rate 
weights are chosen as w n = 1. Two users are allowed to be 
scheduled on the same channel, if they are served by different 
antennas. The bandwidth of each receiver is 1MHz, and the 
noise figure of each receiver is 5 dB. The conservative noise- 
plus-interference power cr^ k is chosen to be 5 dB greater 
than the noise power. Therefore, the noise-plus-interference 
power at each receiver is peak = —174+60 + 5 + 5= —104 
dBm. We utilize C n in (f2]i to formulate the power allocation 
problem ©. After the power allocation solution is derived, 
we substitute it into the original rate function C n in ([U to 
compute the achievable data rate. All the simulation results 
are obtained by averaging over 1000 system realizations. 

We compare our proposed power allocation strategy for 
problem (01 with the following 2 reference strategies: The first 
strategy considers the optimal power allocation for downlink 
CoMP transmissions with no interference, which provides a 
performance upper bound of the practical scenarios with in- 
terference. The second one is a simple equal power allocation 
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Antenna TX power P(dBm) 



Fig. 2. Simulation results of per-user throughput versus transmission power 
P for N = 70. 

(EPA) strategy, where each antenna allocates its transmission 
power equally to serve its users. 

Figure [2] illustrates the simulation results of per-user 
throughput versus transmission power P for different power 
allocation strategies, where each cell has 10 users. 

Figure [3]presents the simulation results of per-user through- 
put versus the number of users per cell N/7, where the 
transmission power P = 20 dBm. One can observe that the 
proposed power allocation strategy has a small gap from the 
performance upper bound, especially when the transmission 
power P is small. However, the simple equal power allocation 
scheme has a lower throughput. The performance of equal 
power allocation is poor, because the wireless links from 
different antennas to one user have quite different channel 
quality. The base station should spend more power on the 
strong wireless links, instead of using the same power for 
different wireless links. Through careful user scheduling and 
setting reasonable threshold of noise amplification, the pro- 
posed algorithm can achieve performance approaching to that 
of the ideal non-interference scenario. Therefore, the proposed 
power allocation strategy plays an essential role to realize the 
benefits of downlink CoMP transmissions in DAS. 

Figure 2] illustrates the evolutions of the dual optimality gap 
of the proposed power allocation algorithm and the distributed 
optimization algorithm of |fl9l for N — 175 and P = 30dBm, 
where the dual optimality gap is given by L(p(t), A(t), y(t)) — 
f(y*)- The parameters of our distributed power allocation 
algorithm are chosen as c„ = 3, otk = 2min _t"eg>)} c " ^ an( j 
(3 = 1. The parameters of the algorithm of IU9l are given 
by c n = 3, ah — min {neAf} c " / see Remark 4) an( j 

3 " ' K 2max {fceK ;} \U(k)\ v 

j3 = 1. Since the step-sizes of our proposed power allocation 
algorithm are larger than the reference algorithm in [1191 , our 
algorithm exhibits a faster convergence speed. We note that 
this is the convergence speed when the algorithm is cold 
started. In practice, since the channel condition varies slowly, 
the power allocation solution from the previous run of the 
algorithm is an excellent initial state for warm-starting the 
algorithm. By this, the algorithm generally converges much 
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upper bound 
proposed algorithm 
EPA 




15 20 25 

Num of user per cell N/7 

Fig. 3. Simulation results of per-user throughput versus the number of users 
per cell N/7 for P = 20dBm. 




100 200 300 400 500 600 700 

Iteration number 

Fig. 4. Convergence of our proposed algorithm and the algorithm in 1191 
for N = 175 and P = 25dBm. 



faster. 



V. Conclusions 



We have proposed a distributed power allocation algorithm 
for downlink CoMP transmissions in DAS. We considered 
an approximate power allocation problem with a non-strictly 
concave objective function, which makes traditional duality- 
based optimization techniques not applicable for this problem. 
We have resolved this non-strict concavity issue by adding 
some quadratic terms to make the objective function strictly 
concave, and developed a distributed algorithm to solve the 
power allocation problem. A key merit of this algorithm 
is that its convergence speed is robust to different values 
of the channel coefficients. Its implementation only requires 
local information exchange among neighboring base stations 
serving the same user. The convergence and optimality of 
this algorithm has been established rigorously. Our simulation 
results have revealed that significant throughput improvements 
can be realized by this power allocation algorithm. 



Appendix A 
Proof of Lemma 1 

Since g(x) is a concave function of x, B(p) = 
g(Y,kPklk) - Yjki a kPk + b kPk + Cfe) is also concave with 
respect to p. The KKT conditions indicate 



dB 

dpk 



= 0, 



and 



dB 

dpk 



f=0 \ip%>0, 



(A.l) 



(A.2) 



By taking the weighted summation of the partial derivations 

bklk- 



we obtain 



Let s = J2kPk,oJk, ( IA.1I ) and ( IA.3b imply 



a k . 



9 



— n i- — * 



ak 



bklk 
ak 



0. 



(A.3) 



(A.4) 



k k 

For s* = J^k Pklk, dA.2b and dA.3b suggest 

k k 



(A.5) 



Comparing (I A. 4i and (IA.5t . we derive that 

[5'(s)-3V)]E?- 2 ( s - s *)^°- 



ak 



(A.6) 



If s - s* ^ 0, then 

V(s) -<?'(**) 



(s-s*) 



k ^ . 



> 0. 



(A.7) 



Since g(x) is a concave function, 



g'(s)-g'(s') 



< 0. Further, 



by the positivity of a/., we have s — s* < 0. 

Suppose there exists some k such that pk,o < and p* k > 0. 
Then, dATT i and dA2l imply 



dB 
dpk 



= 0, 



(A.8) 



Pk=v k 



dB 
dpk p 

which further suggests 

Pk.o = 7T- hkg'{s) - b k ] , p* k = — bkg'{s*) - b k ] ■ 

Zdk idk 

(A.9) 

Since s — s* < and g(x) is a concave function, we have 
g'( s ) > g'( s *)- Therefore, p* k < Pk,o, which contradicts with 
the assumption of pk o _ and p* k > 0. Therefore, if pk o < 0, 

p* k = 0. 
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Appendix B 
Proof of Lemma[2] 

We need to use the fact that f(p) is in the form of 
En fn(Y J kPknlkn), where /„(•) is a concave function. Equa- 
tion d34l i can be also written as 

V/„( Y Pknlkn)lkn-^k-Cn{Pkn~ykn) = 0, Vfc S R(n), 
k£R{n) 

(B.l) 

where V/„(-) is the subgradient of /„(•). By conducting a 
weighted summation of the equations in ( IB. 11 ), we obtain 

V/„( Y Pknlkn) Y ~ E Xklkn 

feefl(n) k£R(n) k£R{n) 

- C n Y Pknlkn + c n E Vknlkn = 0. (B.2) 
k£R(n) keR(n) 

Let us define (i = 1,2) 

a n ,i = V/„( ^ Pkn.ilkn) - V/„( J/fc„7fcn)»(B.3) 
keR(n) keR(n) 

b-n i = > VknAlkn - > J/L^fcn- (B.4) 



~ ^ ] Pkn.ilkn ^ ' Vknlkn- 
k k 



Then, ( IB. 2b indicates 

(a„.i - a„, 2 ) Z 7L - c„(6„4 - 6„ >2 ) 

fc£-R(n) 

= E (Afc ^ ~ A fe.2)7fcn, fc G iZ(n). (B.5) 

keR(n) 

Then, the formula on the left hand side of (l37l can be 
written as 

[V/(p*l) - Vf(f )f (P2 ~ V* ) = Yl a n,lK,2- (B.6) 



Since /«(•) is a concave function, we obtain 

a n ,l&n,l < 0, a n ^b n .2 < 0. 



(B.7) 



Cnbn 1 



Then 



C-nbnA 



On, 2 



, 1 



EfeGJi(n) 7fe 
c n b n ,2 



<- 



< 



EkeR(n) ll n 

Efc£fl(n)(Afc,l ~ ^k,2)jkn 
2 

2^ik£R(n) Ikn 
ibl.2+K,2j2k£R(n)(^k.l^^k,2hkr 
EkeR(n) 7fc 



Efc G R(n)( A M ~ Afc, 2 )7fc 



b n ,2 (by (E3) 



(by a„, 2 &n,2 < 0) 



(by completing the square) 

4 C « EfcS-R(n) 7fc„ 

<— ^ (A fc ,i - A M ) 2 (by Cauchy-Schwarz). (B.8) 



Therefore 

a n ,ib n<2 < — Y (Afc,i-A M ) 2 . (B.9) 

™ keR(n) 

The statement of Lemma [2] follows by substituting ( IB. 91 ) into 



Appendix C 
Proof of TheoremQ] 

Let us define the norm of dual and auxiliary variables: 

||A|U = A^- 1 *, ||yl|y = fVy, \\y\\ BV = y T B^Vy. 

Suppose that (y* , A* ) is a stationary point of Algorithm A, 
we will show that the Lyapunov function 

v(y(t),\(t)) = \\X(t) - A*|U + W) -S*\\bv (CI) 

is non-increasing in iteration number t. 
In |fl9l , it was shown that 

v(y(t + l),X(t + l))-v(y(t),Ht)) 
<-||A(* + l)-A(i)|U 
+ (X(t + 1) - X(t)) T EV- 1 E T (X(t + 1) - X(t)) 
- \\y(t) - P(t)\\v 

+ 2[Vf(z{t)) - VfmfW) - y*)- (C.2) 
Invoking Lemma [2] we have 

[V/(z(*)) - v/(r )f(p(t) - it) 
<\{X{t + 1) - X^fEV^E 7 ^ + 1) - A(i)). (C.3) 



Substituting ( IC.3b into ( IC.21 ). we obtain 

«(#+l),A(t + l))^(*) ! A(t)) 

< - (X(t + 1) - A(t)) T C7(A(t + 1) - X(t)) - \\y(t) - mt)\\ v , 

(C4) 

where C = A^ 1 — ^EV~ 1 E T . If C is non-negative definite, 
then the Lyapunov function v(y(t),X(t)) is non-increasing 
in iteration number t. Then, we can prove that (y* , A* ) is 
a stationary point of Algorithm A by using the standard 
Lyapunov drift arguments in |[T9l Prop. 4]. According to the 
standard duality theory, if (y*,X*) is a stationary point of 
Algorithm A, then p* — y* provides a solution to ([3). 

Finally, we need to show that C is a non-negative definite 
matrix, if (f36t is true. Let x be any vector of K dimensions, 
according to (1261) . we have 

x T A~ x x- -x T EV~ 1 E T x 
2 



K 



N 



Y a k lx l- oY J2 c « 1; 



k=l 
K 



n=l keR(n) 
K 



Y a k lx l- oY J2 c « : 



x k 



k=l 
K 



k=l n€U{k) 



feeiJ(n) 



r 2 



(C.5) 



fc=i 



neU{k) 
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By d36l l, we can obtain 



a k < 



This further suggests 



2 min. 



ne(7(fc) c n 



3\U(k)\ 



a fc 1 > 7r|E/(fc)| max c,/ > | n 1 . 



(C.6) 



(C7) 



neU(k) 



Substituting ( IC.7b into ( IC.5I ). we obtain that x 1 Ar x x - 
^x 1 EV~~ X E 7 x > for any x. Therefore, C is a non-negative 
definite matrix. 
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Abstract — This paper investigates the distributed power al- 
location problem for coordinated multipoint (CoMP) transmis- 
sions in distributed antenna systems (DAS). Traditional duality- 
based optimization techniques cannot be directly applied to 
this problem, because the non-strict concavity of the CoMP 
transmission's achievable rate with respect to the transmission 
power induces that the local power allocation subproblems have 
non-unique optimum solutions. We propose a distributed power 
allocation algorithm to resolve this non-strict concavity difficulty. 
This algorithm only requires local information exchange among 
neighboring base stations serving the same user, and is thus 
flexible with respect to network size and topology. The step- 
size parameters of this algorithm are determined by only local 
user access relationship (i.e., the number of users served by each 
antenna), but do not rely on channel coefficients. Therefore, the 
convergence speed of this algorithm is quite robust to channel 
fading. We rigorously prove that this algorithm converges to an 
optimum solution of the power allocation problem. Simulation 
results are presented to demonstrate the effectiveness of the 
proposed power allocation algorithm. 

Index Terms — Coordinated multipoint transmission, distribu- 
ted power allocation, distributed antenna system. 



I. Introduction 

The explosive growth of mobile access services has led 
to a huge demand for enhanced throughput and extended 
coverage in the next generation wireless networks. In recent 
years, distributed antenna system (DAS) has emerged as a 
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promising network architecture to achieve these goals [fl]-|[3]. 
In this architecture, each base station is equipped with some 
remote antennas which are distributed in the entire cell area, as 
shown in Fig. [TJ These distributed antennas are connected to 
the base station via wired backhaul network. By this, nearby 
distributed antennas are able to coordinate with each other 
and provide enhanced service experience to the mobile users. 
This technique is called the coordination multipoint (CoMP) 
transmission in the literature 0, H. 

One of the key techniques to realize high throughput in 
wireless networks is power allocation. Traditionally, power 
allocation of wireless networks is handled by centralized 
algorithms, e.g., JSJ — BSJ - These algorithms request multi-hop 
signaling mechanisms to gather the channel state informa- 
tion (CSI) of all the wireless links at a central processing 
unit in a short time period, and then distribute the obtained 
power allocation solution to the transmitters. Such mechanisms 
would generate enormous signaling overhead on the backhaul 
network, and is probably not scalable when the network size 
grows large. 

Recently, a great deal of research efforts have focused on 
distributed power allocation for various wireless networks. 
Game theory based power allocation techniques were pro- 
posed in j9l- lfT3l . which intend to compute Nash equilibrium 
power allocation solutions. However, these Nash equilibrium 
solutions might be far from optimality |[T2| . Duality -based 
distributed power allocation techniques were proposed in [ 14 j- 
|[T6l , where the global power allocation problem is decom- 
posed into many local power allocation subproblems, each of 
which can be solved by utilizing only locally available network 
information. However, these techniques cannot be directly 
applied to CoMP transmissions in DAS — the local power 
allocation subproblem may have many optimum solutions, 
because the data rate of CoMP transmission is not strictly 
concave with respect to the power variables 11141 . ifTTI . Since 
no global network information is available when solving the 
local power allocation subproblems, it is quite difficult to 
find a global feasible solution among all the locally optimum 
solutions. 

One promising method to address this non-strict concavity 
problem is the proximal point method [18], which adds strictly 
concave terms to the objective function without affecting the 
optimum solution. However, typical proximal point algorithms 
require a two-layer nested iteration structure, where each 
outer-layer update can proceed only after the inner-layer 
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Fig. 1. System model of coordination multipoint (CoMP) transmissions in 
a distributed antenna system (DAS). 



iterations converge |[T8l . Such a structure is not suitable for 
on-line distributed implementation, because it is difficult to 
decide in a distributed manner when the inner-layer iterations 
can stop. In |fl9l , a single-layer proximal point algorithm 
was proposed for multi-path routing problems. However, the 
convergence analysis in [|T9l also cannot be directly utilized 
for the power allocation problem considered here, owing to the 
additional channel coefficients in our problem. It is difficult to 
answer if the channel coefficients have significant impact on 
the convergence behavior of the algorithm mentioned above. 

This paper investigates the distributed power allocation 
problem for a downlink DAS with many antennas and many 
single antenna users. Each user is served by several nearby 
antennas via CoMP transmission techniques. Meanwhile, each 
antenna may serve several users over orthogonal channels. The 
main contributions of this paper are summarized as follows: 

1) A distributed power allocation algorithm is proposed to 
maximize the weighted sum rate of the downlink DAS, 
subject to per-antenna power consumption constraints. 
This power allocation algorithm is implemented distribut- 
edly among the base stations instead of being executed 
in a centralized fashion. The algorithm possesses a nice 
single-layer iteration structure, which is desirable for on- 
line implementations. In each iteration, the algorithm only 
requires local information exchange among neighboring 
base stations serving the same user, which is flexible with 
respect to network size and topology. 

2) A novel procedure is proposed to compute the primal op- 
timum solution of the local power allocation subproblem, 
which is simpler than that proposed in lfl9l . 

3) We rigorously analyze the convergence and optimality of 
the proposed distributed algorithm for the power alloca- 
tion problem. The bounds of the step-size parameters to 
ensure convergence are derived. 

4) We show that the step-size parameters of this algorithm 
are determined by only local user access relationship (i.e., 
the number of users served by each antenna), but do not 



rely on channel coefficients^! Therefore, the convergence 
speed of this algorithm is quite robust to channel fading. 

Our proposed power allocation algorithm is motivated by 
the work of |[T9l . However, our work differs from it in several 
respects. First, our analysis indicates that a larger step-size 
can be used for the algorithm in |fl9l , which can achieve 
a faster convergence speed. Second, while our problem has 
additional channel fading coefficients, we show that the step- 
size parameters and the convergence speed of our algorithm 
are robust to different channel fading coefficients. Finally, our 
procedure for solving the local power allocation problem is 
simpler than that proposed in |fl9l . 

For ease of later use, we define the following notations: Let 
\S\ denote the number of elements in set S, and let S/T denote 
the set S/T = {x\x € S, x ^ T}. The projection of a real 
number x on the set [0,oo) is defined as [x] + = max{a;,0}. 

The remaining parts of this paper are organized as follows: 
In Section [ill the system model and problem formulation are 
presented. Section [HI] presents the proposed power allocation 
algorithm and its distributed implementation. Simulation re- 
sults of the proposed power allocation strategy are presented 
in Section IV. Conclusions are drawn in Section V. 

II. System Model and Problem Formulation 

A. System Model 

We consider a downlink DAS with K distributed antennas 
and N single antenna mobile users, which are denoted by K, = 
{1,2, ■■■ ,K} and M = {1,2,- ■• ,N}, respectively. Each 
base station is equipped with several distributed antennas, as 
illustrated in Fig. Q] These distributed antennas are connected 
to the base station via wired backhaul network. The total 
throughput of this network is limited by the strong co-channel 
interference. By allowing several nearby antennas to transmit 
to one user in a coordinated fashion, the CoMP transmission 
techniques, such as space-time block coding or maximum 
ratio transmission Q, convert the strong interferences into 
useful signals and thereby significantly boost the total network 
throughput. The set of antennas serving the nth user is denoted 
by R(n) C /C, and the set of users communicating with the kth 
antenna is expressed as U(k) C A/". In practice, the number of 
serving antennas for each user, i.e., is usually small, 

due to the limitation of implementation complexity for CoMP 
transmissions. 

When the density of the distributed antennas is high, CoMP 
transmissions can not mitigate all the strong interferences, 
which results in some strong residual interferences. In iffOll . 
it was shown that orthogonal transmission is Pareto optimal 
for strong interference Gaussian channels. Therefore, the users 
with strong mutual interference should be scheduled to com- 
municate over orthogonal channels via frequency (or time) 
division multiple access, while geographically separated users 
with weak mutual interference are allowed to share the same 
channel resource. This scheduling task belongs to the type 
of timetabling problem, which is a classic problem in the 

'The channel coefficients are only utilized locally to solve the local power 
allocation subproblem. 
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computer science literature with many practical algorithms 
ED, E2- 

After selecting proper antennas for CoMP transmission and 
scheduling the users, there are only weak interferences in the 
network. We consider a slow fading wireless environment. Let 
hkn be the complex coefficient of the wireless channel from 
the fcth antenna to the nth user and pk n be the transmission 
power of the fcth antenna for serving the nth user. The data 
rate of CoMP transmission to the nth user is given by 



DAS: 



Cn = l0g 2 1 



0-2 + Z)(fc,m)e/(n) \hkn\ 2 Pkm J ' 



(1) 



where I(n) is the set of source antenna and serving user 
pair which may interfere the nth user, or more specifically, 
(fc, m) E I(n) represents that the fc-th source antenna serving 
the m-th user through the same serving channel of the nth 
user. 

There are two difficulties for utilizing the data rate function 
C„ to formulate the power allocation problem: First, it leads 
to a non-convex optimization problem that is NP-hard 11231 . 
for which one may not be able to find a solution that is both 
fast and global optimal even by centralized optimization. The 
design of a distributed optimization algorithm will be even 
more difficult, if not impossible. In order to reduce the solution 
complexity, we need to find an approximate rate function of 
C„ that is convex. Second, it can be quite difficult to attain 
the exact expression of the rate function C„. In practice, the 
number of interfering antennas is usually much larger than 
the number of source antennas. Although the receiver can 
get an accurate estimation of the channel gain \hkn\ 2 for 
each source antenna k E R{n), it may be too demanding 
to estimate the channel gain from the enormous interfering 
antennas, especially when the powers of the interference 
signals are weak. On the other hand, estimating the noise- 
plus-interference power t7 2 = (T 2 + Z)(fc, m )e/(n) \ h kn\ 2 Pkm 
is obviously much easier. For these reasons, we consider to 
utilize an upper bound of the noise-plus-interference power 



cr„, which is denoted by <r r 
rate function. Let jkn = \hkn 



to derive an approximate 
1 ' l a n peak denote the normalized 



channel gain from the fcth antenna to the nth user. Then, we 
derive a conservative rate function 



log 2 



^ Pknlkn 



< C r 



(2) 



The key benefit of the conservative rate function C„ is that 
it is convex and is computable without accurate knowledge of 
the channel gain \hk n \ 2 for the enormous interfering antennas, 
which resolves the two difficulties mentioned above. We will 
illustrate the rate loss for using this conservative rate function 
to formulate the power allocation problem in Section |IV] 



B. Problem Formulation 

The rest of this paper focuses on the following power 
allocation problem to maximize the weighted sum rate of the 



N 



s.t. 



max ^ W n l0g 2 1 + ^2 Pknlkn | 

n=l \ keR(n) 

Pkn<Pk, k = l,2,...,K, 

neU(k) 

p kn >0, k = l,2,...,K, neU(k), 

where w n > is the weight of the nth user's data rate and 
Pk is the maximal allowable transmission power of the fcth 
antenna. 

This problem is a convex optimization problem, which can 
be solved by standard centralized convex optimization algo- 
rithms such as the interior point method 11241 . However, these 
centralized algorithms are hard to be fulfilled in large-scale 
DAS, due to the heavy signaling overhead over the backhaul 
network. In contrast, duality-based optimization techniques 
lfT4l - lfT6l cannot be directly applied to this problem, either, be- 
cause they require the objective function to be strictly concave. 
However, the objective function in (J3j is not strictly concave 
with respect to the transmission power variables, since it is 
constant when the value of YlkeRM Pknlkn is fixed. If the 
duality-based optimization techniques Ifl4l - lfl6l are utilized, 
the decomposed local power allocation subproblem may have 
many locally optimum solutions at some special dual points. 
It is quite difficult to recover a global feasible solution among 
all the locally optimum solutions. When the dual variables are 
updated around these dual points, the primal power allocation 
variables keep oscillating and hardly converge (see lTl9l for 
more details). 

III. Distributed Power Allocation Algorithm 

In this section, we propose a power allocation algorithm to 
solve the problem (O, which is distributed among the base sta- 
tions instead of being centralized over the entire network. The 
key feature of this algorithm is that its step-size parameters 
and convergence speed are robust to different channel fading 
coefficients, which makes our algorithm quite convenient for 
practical implementations. The details are provided in the 
following subsections. 

A. Single-layer Distributed Power Allocation Algorithm 

To circumvent the aforementioned oscillation problem, we 
make use of the idea in the proximal point method lfl8l . 
which is to add some quadratic terms to the objective function 
and make it strictly concave in the primal variables. We 
reformulate the original power allocation problem (fj} as 



max 



N 

E 

n=l 



w n log 2 (l + 



k£R(n) 



Pknlkn) 



N 



E E -^(Pkn-Vkn) 2 
n=l k£R(n) 



(4) 



s.t. Yl Pkn<Pk, k = 1,2,..., K, (5) 

n£U(k) 

Pkn >0, fe = 1,2, n€U(k), 
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where we have introduced some quadratic auxiliary terms to 
make the objective function strictly concave with respect to 
the transmission power variables. Here, y kn is the auxiliary 
variable corresponding to p kn , c n > is the parameter of 
the quadratic terms. For notational convenience, let us use 
the |-R(n)| dimensional vector p n to denote the transmission 
power variables of the antennas serving the nth users, and 
the Yln=i \R( n )\ dimensional vector p = [p[ ,p% , • • • ,Pn] T 
to denote all the transmission power variables. Similarly, we 
define the \R{n) \ dimensional vector y n and the 
dimensional vector y = [iff \y% ', • • • ,y^] T as the auxiliary 
variable vectors corresponding to p n and p. It is known that 
the optimum value of the objective function in coincides 
with that in (0 1181 . In particular, if p* is the optimum solution 
to ©, then p = p*,y = p* solves ©. 

The standard proximal point method in general has a two- 
layer nested optimization structure: the inner layer iterations 
optimizing p for fixed y by a Lagrangian dual optimization 
method, and the outer layer iterations optimizing the auxiliary 
variable y. Such a layered structure is not suitable for on-line 
distributed implementations, because it is difficult to decide 
in a distributed manner when the inner-layer iterations have 
converged. In the following, we present a modified proximal 
point method with a single-layer optimization structure, where 
the outer-layer update of y does not request that the inner-layer 
dual updates have converged. 

The Lagrangian of the problem dU can be written as: 



L(p,\,y) 



N 

E 

n=l 



log 2 (l 



/ J Pknlkn) 
keR(n) 



K 



E Afc ( E Pkn ~ Pfc ) 
k=l n£U(k) 



N 



E E -J^Pkn - Vknf , 



(6) 



n=\ keR(n) 



where A = [A 1; A 2 , • ■ ■ , Xk] T is the vector of dual variables 
corresponding to the constraints in (|5}- Now we are able 
to present our distributed power allocation algorithm as the 
following: 

Algorithm A: Single-layer Distributed Power Allocation 
Algorithm 

At the ith iteration, 

Step 1: Dual variable update: 

Let y = y(t) and A = X(t), maximize L(p,X,y) with 

respect to p: 

p(t) = aigmax^ L(p, X(t),y(t)). (7) 
Update the dual variables by 

X k (t+l) = [X k (t)+a k ( ]T Pkn-Pk)} + , (8) 

n£U(k) 

where a k is the step-size of the dual update. 
Step 2: Auxiliary variable update: 

Let y = y(t) and A = X(t + 1), maximize L(p, A, y) with 
respect to p: 

z(t) = arg maxp> L(p, X(t + 1), y(t)). (9) 



Update the auxiliary variables by 

Vkn(t + 1) = Vkn{t) + P(z k n{t) ~ Vkn{t)), (10) 

where < f3 < 1 is the step-size for auxiliary variable 
update. 

The value of f3 can be chosen arbitrarily in (0,1]. The 
choices of a k to ensure convergence of Algorithm A will be 
discussed in Section [TH-CI We note that while the convergence 
analysis in Ifl9l apply for the degenerated case of ^j kn = 1, 
it is difficult to answer if practical channel coefficients ^ kn 
would have significant impact on the convergence behavior of 
the algorithm. One major contribution of this paper is to show 
that the step-size parameters <x k to ensure convergence are 
irrelevant of ^ kn (see Section IIII-Cb . Since the convergence 
speed of iterative optimization algorithms is mainly affected 
by the step-size, the convergence speed of our algorithm is 
quite robust to different values of j kn . We will also show 
that step-sizes larger than those of |fl9l can be utilized in our 
algorithm to achieve a faster convergence speed. 



B. Distributed Implementation of Algorithm A 

We proceed to explain how to implement Algorithm A in 
a distributed fashion. The Lagrangian maximization problems 
(0 and (O can be decomposed into many independent local 
power allocation subproblems for each user. Specifically, the 
terms of the Lagrangian © can be reassembled as 



Lip, X,y)=^2 



w, 



l0g 2 (l + ^ Pknlkn) ~ 2J ^kPkn 
keR(n) fcei?.(n) 

K 

^A fc P fe . (11) 



Vkn) 



k€R(n) 



k=l 



Therefore, the Lagrangian maximization problems (|7) and © 
can be rewritten as 



N K 

maxL(p,X,y n ) = V" max B n (p n , A, y n ) + } X k P k , (12) 

^r>n — ' r> >o * — * 
n=l y - fc=l 



p>0 

where 



B n {p n , X,iJ n ) =W„l0g 2 (l + 2_j Pknlkn)- ^ X k p kn 

keR(n) keR(n) 

~ E -^(.Pkn-Vkn) 2 - (13) 

Therefore, problems ([7) and (|9} can be decomposed into a 
series of local power allocation subproblems. 

In practice, the resource allocation of a user is carried out at 
a nearby base station. However, the antennas serving this user 
may belong to several base stations, as illustrated in Fig. Q] 
Therefore, neighboring base stations need to exchange infor- 
mation during the iterations of Algorithm A. The distributed 
implementation procedure of Algorithm A is described as 
follows: 

At the tth iteration of Algorithm A, the base station 
assigned to the nth user first utilizes the channel quality 
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information {lkn}keR(n) to solve a subproblem of (0, given 
by 

{Pkn(t)}keR(n) = Bxgmaxp n > Q B n (jp n , X(t),y n (t)), (14) 

and forwards the power allocation solutions {pkn(~t)} keRM 
to nearby base stations controlling the antennas k G R(n). 
Then, the base station controlling the fcth antenna utilizes the 
power allocation solutions {pkn(t)} n eu(k) to update the dual 
variable Afc(i + 1) according to dS), and sends Xk(t + 1) to the 
base station assigned to the nth user. Next, the base station 
assigned to the nth user solves a subproblem of ©, i.e., 

{zkn(t)}k£R( n ) = argmaXj ?n > .B„(p n , X(t + 1), y„(t)), 

(15) 

and utilizes the resultant solution {zk n {t)}keR(n) to update the 
auxiliary variables yk n (t + 1) according to ([Tol l. Therefore, 
Algorithm A can be implemented in a totally distributed 
fashion, and it only requires local exchange of the power 
allocation solution Pkn(t) and the dual variable Xk(t + 1) 
among neighboring base stations in each iteration. In addition, 
when the channel power gain -fkn changes, each user sends 
the updated channel power gain to its assigned base station. 

1 ) Solution to Local Power Allocation Subproblem (I141) : 
The optimum solution to ( fT~4-b satisfies the following Karush- 
Kuhn-Tucker (KKT) conditions (21: 

OB 

n w nlkn a 

dp kn In 2(1 + J2keR(n) Pknlkn) 

= 0, if p kn > 0; 
<0, ifp fc „ = 0, 



CnijPkn ~ ykn) 



Define 



fi(n) = {k G R(n)\p kn > 0} 



V k G R(n). 
(16) 

(17) 



as the set of antennas serving the nth user with positive power. 
Hence, pk n = for all k G R(n)/Cl(n). If O(n) is known, 
the KKT conditions in (TBI) indicate 

Wnlkn , / n n 

i on j ^ r - Afc - C n (p k n ~ ykn) = 0, 

ln2(l+^ feen(n) Pknlkn) 

V k e O(n). (18) 

By conducting a weighted summation of the equations in dT8l ), 
we obtain an equation of Efc G n(n) Pknlkn, i-e., 

Ea.-gJ2(^ 



T / IknXk ~ C„ IknPkn 



AGfi(n) 
fc£0(ra 



Aef2(n) 
7fen2/fcn = 0. (19) 



Let us define s„ = Efc £ n(n) Pknlkn, then dT9j can be 
reformulated as 



CnSn (Cn f^n)^n ~t~ 7ri 



0. 



(20) 



where 7 „ = E fee n(n) W »7L/ ln2 < M« = Efeenf„) W** - 
c n ykn)- The root s„ of the quadratic equation (|20b is given by 

s n = 5— [~(c„ + Mn) + V (c„ + ^„) 2 - 4c„(^„ - t~)]. 

(21) 



Substituting d2TT > into d!8t . we obtain the optimum solution to 
the subproblem ( fT4l as 



PA- 




A* 



if k G fl(n) 



ln2(l+s„) 

if fc G i?(n)/fi(n). 

(22) 

2) A Novel Procedure to Derive f2(n): Until now, the left 
task is to determine f2(n) in the optimum solution to (fT~4T >. Let 
us consider the unconstrained problem corresponding to ( [Pfl i. 
i.e., 

p n ,o = argmax5 n (p„, A, y„). (23) 



Our research indicates that if pkn.o in the solution to 
satisfies pkn.o < 0, then the solution to (fT~4T > must satisfy pfe„ = 
(i.e., k 51(n)). This statement is expressed in the following 
lemma: 

Lemma 1 Suppose that g(x) is a differentiable concave 
function on x G [0, oo), and B{p) is defined as B(p) = 

9(J2kPklk) - J2k( a kPk + b kPk + Cfc) With Cl k > 0. // 

{p* k } = argmaxp.> i?(p) and {p k ,o} = argmax^B(p7, then 
p* k = /or any k satisfying pk t a < 0. 

Proof: See Appendix A. ■ 
With Lemma Q] we are able to compute the optimum choice 
of fi(n). The detailed procedure is given as follows: 
P-l. Initialization: Set Q(n) = R(n). 

P-2. Compute s„ and pk n according to d2"T1 i and (l22l . respec- 
tively. 

P-3. If pfe„ > for all fc G Jl(n), output J7(n) and exit; 
otherwise, set Cl/ n ) = {k\pk n > 0, fc G i?(n)} and return 
to P-2. 

Remark 1: Lemma Q] allows us to rule out all the elements 
k with pkn < from D,(n) in one iteration. Therefore, the 
proposed procedure can converge much faster than the method 
proposed in |fl9l , l25l . which is to eliminate only one element 
k with the smallest negative p kn in each iteration. Our method 
significantly reduces the number of iterations to compute S7(n) 
and does not require sorting procedure of {p kn }- 

C. Convergence Analysis 

In this subsection, we obtain the bounds on the step-sizes 
at to ensure convergence. First, we need some notations 
and definitions to simplify the expressions of our theoretical 
analysis. Let us consider the function 

= f E^=l W n l0g 2 (l + J2keR(n) Pknlkn), if Pkn > 0, 



f(P) 

-oo, otherwise, 

(24) 

which has incorporated the power constraint p > in the 
definition. The analysis of this subsection applies to any 
objective function f(p) in the form of }~2 n /n(EfePfe«7fen) 
with /„ {■) being a concave function. With d24T >. the Lagrangian 
© can be rewritten as 

L(p,X 7 y) = f(p)-p Jr E T X-^(p-ij) T V(p-y)+X T R, (25) 

where E is a K x Ej=i l-^(j) I dimensional matrix with binary 
elements representing the relationship between the antennas 
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and their transmit power variables, i.e., if the fcth antenna is 
selected to serve the nth user, one of the |i?(n)| elements on 
the fcth row and the (E"=i |-R(j)| + l)thto the (E" =1 \R(j)\)th 
columns is 1; otherwise, all of these |i?(n)| elements are 0. 
Moreover, it satisfies 

££al%)l K 

E ki = \U(k)\, Vi, J2 E ^ = 1 ' Vfc ' (26) 

i=l k=l 

because the fcth antenna serves \U(k)\ users and each trans- 
mit power variable belongs to only one antenna. V is a 
Sj=il-^(j')l x 2~2f=i\R(j)\ diagonal matrix with diagonal 
elements c n representing the parameters of the quadratic terms. 
R = [Pi, P2, Pk] T represents the vector of maximal 
transmission power of the antennas. Therefore, the Lagrangian 
maximization problems © and (0 can be expressed as 



and 



z(t) = arg max-L(p, X(t + 1), y(t)), 



(27) 



(28) 



respectively. Let A be a K x K diagonal matrix with diagonal 
elements ct k representing the step size for dual update. Let B 
be a Y^,jLi I x SjLi I diagonal matrix with diagonal 
elements /3 representing the step size for auxiliary update. 
Then, the dual update (0 and auxiliary update (0 can be 
rephrased as 



X(t + l) = [X(t) + A(Ep(t)-R)} + , 



(29) 



and 



y(t + 1) = y(t) + B(z(t) - y(t))- (30) 
We also need to define the stationary point of Algorithm 



A. 



Definition 1 A point (y*,X*) is a stationary point of Algo- 
rithm A, ;/ 



V 



arg max L(p, X* , y* 



Ey* - R < 0, A* > 0, 
A* <g> (Ey* — R) = 0, 



(31) 

(32) 
(33) 



where x ® y represents the Hadamard (elementwise) product 
of two vectors x and y with the same dimension. 

Let us further consider a Lagrangian maximization problem 
p = argmaxp'L(p, X,y). The KKT conditions suggest that 
there must exist a subgradient V/(-) of /(•) satisfying 



Vf(p) - E T X - V(p - y) = 0. 



(34) 



Similarly, let (y*, A*) denote a stationary point of Algorithm 
A, then we can get from OTb that 



V/(r ) - E T X* = 0. 



(35) 



Now we are ready to introduce the main result of this paper 
in the following theorem, i.e., the sufficient condition for the 
convergence of Algorithm A. 



Theorem 1 If the objective function f(p) is in the form of 
J2n fn(J2kPknlkn) with /„(■) being a concave function, and 
the step-size satisfies 

2min{„ e ,7( fe )} c„ 



0!fc < 



(36) 



3|C/(fc)| 

where \U(k) \ is the number of users served by the kth antenna, 
the proposed distributed power allocation Algorithm A will 
converge to a stationary point (y*,X*) of the algorithm, and 
p* = y* is an optimum solution. 

The proof of Theorem Q] relies on the following key result: 



Lemma 2 Let (pi, Ai) and (p2, A2) be two maximizers of the 
Lagrangian ( |25t for fixed auxiliary variable y, i.e., p\ = 
arg max L [p, y, Xi ^ and P2 — arg max Z (^p, y, A2^, and 

(y* , A*) is a stationary point of Algorithm A, then 



< 7 fA 2 - Xi) EV~ 1 E T (X 2 - X 



(37) 



where Vf(pi) and V/ (y*) are defined in i34i and (I35l >. 

Proof: See Appendix B. ■ 
With Lemma [2] we are able to prove Theorem 1 . The details 
are relegated to Appendix C. Some remarks about Theorem 1 
are provided as follows: 

Remark 2: If we choose = """Ji^/^)^ C " ■> then the 
step-size parameters a k do not rely on the channel coeffi- 
cients 7fc n . On the contrary, they are only determined by the 
number of users served by the fcth antenna, i.e., |f/(fc)|. Since 
the convergence speed of iterative optimization algorithms is 
mainly affected by the step-size, the convergence speed of our 
algorithm is quite robust to different values of jkn- 

Remark 3: It is worthwhile to note that the channel fading 
coefficients 7fc„ is involved in f(p) on the left hand side of 
(|37| |. by not in the right hand side of (l37T i. This is the key 
reason that the bound on the step-size a k in Theorem Q] is 
irrelevant to j kn . 

Remark 4: In |fl9l Lemma 3], the authors proved that 

[V/(pi)-V/(r)] T (P2-r) < \(X 1 -X 2 ) T EV~ 1 E T (Xi-X 2 ), 

(38) 



for the degenerated case of j k n = 1- One can see that 
is looser than the inequality (|37| | in Lemma [2] Moreover, in 
|fl9l Proposition 4], the authors only proved the convergence 
of their algorithm for the step-sizes a k 



2max {fce/C } \U(k)\ ' 

which are smaller than the step-sizes of our algorithm, i.e., 
a k = min ^u(k}\ )} C " • Therefore, our algorithm can achieve a 
faster speed of convergence than that of lfl9l. Some numerical 
results will be provided in the next section to illustrate this. 

Remark 5: Theorem[T]provides a sufficient condition for the 
convergence of Algorithm A, for all the system circumstances. 
According to our simulation experiences, there exist some 
circumstances that larger step-sizes than those of (l36l can 
also obtain an optimum solution to problem (0. However, it 
is difficult to prove that such weaker conditions ensure the 
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convergence of Algorithm A uniformly for all the system 
circumstances. 

Remark 6: If the channel gains change before convergence 
as in the slow fading environment, the resultant power allo- 
cation solution may not be optimum. However, Algorithm A 
is able to track the changes of the slow fading environment 
to some extent. For example, suppose that the channel gains 
change after the algorithm has reached a near optimum solu- 
tion. We can still use the dual variable and auxiliary variable of 
the last iteration as the initial state of the subsequent iterations. 
As long as the changes of channel gains are small, the dual 
variable and auxiliary variable of the last iteration is not far 
from the optimum solution, and the number of iterations for 
convergence is much smaller than using a random initial state. 

IV. Numerical Simulations 

In this section, we present some simulation results to 
demonstrate the efficiency of the proposed power allocation 
algorithm. We consider a downlink DAS with 7 cells. Each cell 
is equipped with 7 distributed antennas, including 1 antenna 
locating at the center of the cell and 6 remote antennas 
distributed near the boundary of the cell. Similar with Fig. [TJ 
the locations of these 49 antennas form a hexagonal lattice. 
The minimal distance between two neighboring antennas is 
D = 1000 meters. The users are distributed uniformly in 
the entire network area, with the extra constraint that the 
distance from a user to a nearest antenna is no smaller than 
10 meters. The wireless channel coefficients are composed by 
three components: large-scale path loss, shadowing, and small- 
scale Rayleigh fading. The path loss and shadowing are deter- 
mined by the SCM model for Urban Macro environments fl26) . 
Specifically, the path loss is given by PL = 34.5+35 \og 10 (d), 
where d is the distance in meters between the user and the 
antenna. The shadowing component satisfies a log-normal 
distribution with zero mean and a standard deviation of 8 
dB. For downlink CoMP transmissions, each user is served by 
\R(n) \ = 3 antennas, which are selected based on large-scale 
channel path loss. The maximal transmission power of each 
antenna is assumed to be the same, i.e., = P- The data rate 
weights are chosen as w n = 1. Two users are allowed to be 
scheduled on the same channel, if they are served by different 
antennas. The bandwidth of each receiver is 1MHz, and the 
noise figure of each receiver is 5 dB. The conservative noise- 
plus-interference power cr^ k is chosen to be 5 dB greater 
than the noise power. Therefore, the noise-plus-interference 
power at each receiver is peak = —174+60 + 5 + 5= —104 
dBm. We utilize C n in (f2]i to formulate the power allocation 
problem ©. After the power allocation solution is derived, 
we substitute it into the original rate function C n in ([U to 
compute the achievable data rate. All the simulation results 
are obtained by averaging over 1000 system realizations. 

We compare our proposed power allocation strategy for 
problem (01 with the following 2 reference strategies: The first 
strategy considers the optimal power allocation for downlink 
CoMP transmissions with no interference, which provides a 
performance upper bound of the practical scenarios with in- 
terference. The second one is a simple equal power allocation 
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Antenna TX power P(dBm) 



Fig. 2. Simulation results of per-user throughput versus transmission power 
P for N = 70. 

(EPA) strategy, where each antenna allocates its transmission 
power equally to serve its users. 

Figure [2] illustrates the simulation results of per-user 
throughput versus transmission power P for different power 
allocation strategies, where each cell has 10 users. 

Figure [3]presents the simulation results of per-user through- 
put versus the number of users per cell N/7, where the 
transmission power P = 20 dBm. One can observe that the 
proposed power allocation strategy has a small gap from the 
performance upper bound, especially when the transmission 
power P is small. However, the simple equal power allocation 
scheme has a lower throughput. The performance of equal 
power allocation is poor, because the wireless links from 
different antennas to one user have quite different channel 
quality. The base station should spend more power on the 
strong wireless links, instead of using the same power for 
different wireless links. Through careful user scheduling and 
setting reasonable threshold of noise amplification, the pro- 
posed algorithm can achieve performance approaching to that 
of the ideal non-interference scenario. Therefore, the proposed 
power allocation strategy plays an essential role to realize the 
benefits of downlink CoMP transmissions in DAS. 

Figure 2] illustrates the evolutions of the dual optimality gap 
of the proposed power allocation algorithm and the distributed 
optimization algorithm of |fl9l for N — 175 and P = 30dBm, 
where the dual optimality gap is given by L(p(t), A(t), y(t)) — 
f(y*)- The parameters of our distributed power allocation 
algorithm are chosen as c„ = 3, otk = 2min _t"eg>)} c " ^ an( j 
(3 = 1. The parameters of the algorithm of IU9l are given 
by c n = 3, ah — min {neAf} c " / see Remark 4) an( j 

3 " ' K 2max {fceK ;} \U(k)\ v 

j3 = 1. Since the step-sizes of our proposed power allocation 
algorithm are larger than the reference algorithm in [1191 , our 
algorithm exhibits a faster convergence speed. We note that 
this is the convergence speed when the algorithm is cold 
started. In practice, since the channel condition varies slowly, 
the power allocation solution from the previous run of the 
algorithm is an excellent initial state for warm-starting the 
algorithm. By this, the algorithm generally converges much 
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upper bound 
proposed algorithm 
EPA 




15 20 25 

Num of user per cell N/7 

Fig. 3. Simulation results of per-user throughput versus the number of users 
per cell N/7 for P = 20dBm. 




100 200 300 400 500 600 700 

Iteration number 

Fig. 4. Convergence of our proposed algorithm and the algorithm in 1191 
for N = 175 and P = 25dBm. 



faster. 



V. Conclusions 



We have proposed a distributed power allocation algorithm 
for downlink CoMP transmissions in DAS. We considered 
an approximate power allocation problem with a non-strictly 
concave objective function, which makes traditional duality- 
based optimization techniques not applicable for this problem. 
We have resolved this non-strict concavity issue by adding 
some quadratic terms to make the objective function strictly 
concave, and developed a distributed algorithm to solve the 
power allocation problem. A key merit of this algorithm 
is that its convergence speed is robust to different values 
of the channel coefficients. Its implementation only requires 
local information exchange among neighboring base stations 
serving the same user. The convergence and optimality of 
this algorithm has been established rigorously. Our simulation 
results have revealed that significant throughput improvements 
can be realized by this power allocation algorithm. 



Appendix A 
Proof of Lemma 1 

Since g(x) is a concave function of x, B(p) = 
g(Y,kPklk) - Yjki a kPk + b kPk + Cfe) is also concave with 
respect to p. The KKT conditions indicate 



dB 

dpk 



= 0, 



and 



dB 

dpk 



f=0 \ip%>0, 



(A.l) 



(A.2) 



By taking the weighted summation of the partial derivations 

bklk- 



we obtain 



Let s = J2kPk,oJk, ( IA.1I ) and ( IA.3b imply 



a k . 
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— n i- — * 



ak 



bklk 
ak 



0. 



(A.3) 



(A.4) 



k k 

For s* = J^k Pklk, dA.2b and dA.3b suggest 

k k 



(A.5) 



Comparing (I A. 4i and (IA.5t . we derive that 

[5'(s)-3V)]E?- 2 ( s - s *)^°- 



ak 



(A.6) 



If s - s* ^ 0, then 

V(s) -<?'(**) 



(s-s*) 



k ^ . 



> 0. 



(A.7) 



Since g(x) is a concave function, 



g'(s)-g'(s') 



< 0. Further, 



by the positivity of a/., we have s — s* < 0. 

Suppose there exists some k such that pk,o < and p* k > 0. 
Then, dATT i and dA2l imply 



dB 
dpk 



= 0, 



(A.8) 



Pk=v k 



dB 
dpk p 

which further suggests 

Pk.o = 7T- hkg'{s) - b k ] , p* k = — bkg'{s*) - b k ] ■ 

Zdk idk 

(A.9) 

Since s — s* < and g(x) is a concave function, we have 
g'( s ) > g'( s *)- Therefore, p* k < Pk,o, which contradicts with 
the assumption of pk o _ and p* k > 0. Therefore, if pk o < 0, 

p* k = 0. 
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Appendix B 
Proof of Lemma[2] 

We need to use the fact that f(p) is in the form of 
En fn(Y J kPknlkn), where /„(•) is a concave function. Equa- 
tion d34l i can be also written as 

V/„( Y Pknlkn)lkn-^k-Cn{Pkn~ykn) = 0, Vfc S R(n), 
k£R{n) 

(B.l) 

where V/„(-) is the subgradient of /„(•). By conducting a 
weighted summation of the equations in ( IB. 11 ), we obtain 

V/„( Y Pknlkn) Y ~ E Xklkn 

feefl(n) k£R(n) k£R{n) 

- C n Y Pknlkn + c n E Vknlkn = 0. (B.2) 
k£R(n) keR(n) 

Let us define (i = 1,2) 

a n ,i = V/„( ^ Pkn.ilkn) - V/„( J/fc„7fcn)»(B.3) 
keR(n) keR(n) 

b-n i = > VknAlkn - > J/L^fcn- (B.4) 



~ ^ ] Pkn.ilkn ^ ' Vknlkn- 
k k 



Then, ( IB. 2b indicates 

(a„.i - a„, 2 ) Z 7L - c„(6„4 - 6„ >2 ) 

fc£-R(n) 

= E (Afc ^ ~ A fe.2)7fcn, fc G iZ(n). (B.5) 

keR(n) 

Then, the formula on the left hand side of (l37l can be 
written as 

[V/(p*l) - Vf(f )f (P2 ~ V* ) = Yl a n,lK,2- (B.6) 



Since /«(•) is a concave function, we obtain 

a n ,l&n,l < 0, a n ^b n .2 < 0. 



(B.7) 



Cnbn 1 



Then 



C-nbnA 



On, 2 



, 1 



EfeGJi(n) 7fe 
c n b n ,2 



<- 



< 



EkeR(n) ll n 

Efc£fl(n)(Afc,l ~ ^k,2)jkn 
2 

2^ik£R(n) Ikn 
ibl.2+K,2j2k£R(n)(^k.l^^k,2hkr 
EkeR(n) 7fc 



Efc G R(n)( A M ~ Afc, 2 )7fc 



b n ,2 (by (E3) 



(by a„, 2 &n,2 < 0) 



(by completing the square) 

4 C « EfcS-R(n) 7fc„ 

<— ^ (A fc ,i - A M ) 2 (by Cauchy-Schwarz). (B.8) 



Therefore 

a n ,ib n<2 < — Y (Afc,i-A M ) 2 . (B.9) 

™ keR(n) 

The statement of Lemma [2] follows by substituting ( IB. 91 ) into 



Appendix C 
Proof of TheoremQ] 

Let us define the norm of dual and auxiliary variables: 

||A|U = A^- 1 *, ||yl|y = fVy, \\y\\ BV = y T B^Vy. 

Suppose that (y* , A* ) is a stationary point of Algorithm A, 
we will show that the Lyapunov function 

v(y(t),\(t)) = \\X(t) - A*|U + W) -S*\\bv (CI) 

is non-increasing in iteration number t. 
In |fl9l , it was shown that 

v(y(t + l),X(t + l))-v(y(t),Ht)) 
<-||A(* + l)-A(i)|U 
+ (X(t + 1) - X(t)) T EV- 1 E T (X(t + 1) - X(t)) 
- \\y(t) - P(t)\\v 

+ 2[Vf(z{t)) - VfmfW) - y*)- (C.2) 
Invoking Lemma [2] we have 

[V/(z(*)) - v/(r )f(p(t) - it) 
<\{X{t + 1) - X^fEV^E 7 ^ + 1) - A(i)). (C.3) 



Substituting ( IC.3b into ( IC.21 ). we obtain 

«(#+l),A(t + l))^(*) ! A(t)) 

< - (X(t + 1) - A(t)) T C7(A(t + 1) - X(t)) - \\y(t) - mt)\\ v , 

(C4) 

where C = A^ 1 — ^EV~ 1 E T . If C is non-negative definite, 
then the Lyapunov function v(y(t),X(t)) is non-increasing 
in iteration number t. Then, we can prove that (y* , A* ) is 
a stationary point of Algorithm A by using the standard 
Lyapunov drift arguments in |[T9l Prop. 4]. According to the 
standard duality theory, if (y*,X*) is a stationary point of 
Algorithm A, then p* — y* provides a solution to ([3). 

Finally, we need to show that C is a non-negative definite 
matrix, if (f36t is true. Let x be any vector of K dimensions, 
according to (1261) . we have 

x T A~ x x- -x T EV~ 1 E T x 
2 



K 



N 



Y a k lx l- oY J2 c « 1; 



k=l 
K 



n=l keR(n) 
K 



Y a k lx l- oY J2 c « : 



x k 



k=l 
K 



k=l n€U{k) 



feeiJ(n) 



r 2 



(C.5) 



fc=i 



neU{k) 
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By d36] l, we can obtain 



a k < 



This further suggests 



2 min. 



ne(7(fc) c n 



3\U(k)\ 



a fc 1 > 7r|E/(fc)| max c,/ > | n 1 . 



(C.6) 



(C7) 



neU(k) 



Substituting ( IC.7b into ( IC.5I ). we obtain that x T A~ 1 x - 
^x 1 EV~~ X E 7 x > for any x. Therefore, C is a non-negative 
definite matrix. 
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